RAW SEQUENCE LISTING 
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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: Off/to CoO 



Source: 



Date Processed by STIC: 



$72/j2oi>l 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTTN 2.1 e-mail help: patin21help(o),uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(o),uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO - ). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error S™mary 



ft] 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



1 



Wrapped Nucleics 



The number/text at the end of each line "wrapped" down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Wrapped Aminos The amino acid number/text at the end of each line "wrapped " down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Incorrect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 



Misaligned Amino Acid 
Numbering 



The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
between the numbering. It is recommended to delete any tabs and use spacing between the numbers. 



Non-ASCII This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 

Variable Length Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 



Patentln ver. 2.0 "bug" 



A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 



Skipped Sequences 
(NEW RULES) 



Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 



1 0 Use of n's or Xaa's 

(NEW RULES) 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



. 1 1 Use of "Artificial" Use of "Artificial" only as "<21 3> Organism" response is incomplete, per 1 .823(b) of New Sequence Rules. 

(NEW RULES) Valid response is Artificial Sequence. 



12 ' 



13 



Use of <220>Feature 
(NEW RULES) 



Patentln ver. 2.0 "bug" 



Sequence(s) 




, are missing the <220> Feature and associated headings. 



Use of <220> to <223> is MANDATORY if <213>ORGANISM is "Artificial Sequence" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 6/01/98, Vol. 63, No. 104, pp. 29631-32) (Sec. 1 .823 of new Rules) 



Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 

Instead, please use "File Manager" or any other means to copy file to floppy disk. 
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DATE: 05/31/2001 
TIME: 12:12:09 



C- 
C- 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/663,600 

Input Set : D:\SEQLIST\Seqlist.txt 
Output Set: C:\CRF3\05312001\I663600.raw 



4 <110> APPLICANT: Dumas Milne Edwards, Jean-Baptiste 

5 Duclert, Aymeric 

6 Bougueleret, Lydie 
TITLE OF INVENTION: EXTENDED CDNAS FOR SECRETED PROTEINS 
FILE REFERENCE: 31.US3.CIP 

-> 12 <140> CURRENT APPLICATION NUMBER: US/09/663,600 
-> 13 <141> CURRENT FILING DATE: 2000-09-15 

PRIOR APPLICATION NUMBER: 60/066,677 



Does Not Comply 
Corrected Diskette Needed 



8 <120> 
10 <130> 

12 <140> 

13 <141> 

15 <150> 

16 <151> 

18 <150> 

19 <151> 
21 <150> 

<151> 
<150> 
25 <151> 

27 <150> 

28 <151> 

30 <150> 

31 <151> 
33 <160> 
35 <170> 



22 
24 



PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
NUMBER OF SEQ ID NOS : 
SOFTWARE: Patent. pm 



1997-11-13 
NUMBER: 60/069,957 

1997- 12-17 
NUMBER: 60/074,121 

1998- 02-09 
NUMBER: 60/081,563 

1998-04-13 
NUMBER: 60/096,116 

1998-08-10 
NUMBER: 60/099,273 
1998-09-04 
229 



ERRORED SEQUENCES 

12186 <210> SEQ ID NO: 228 

12187 <211> LENGTH: 560 

12188 <212> TYPE: DNA 

12189 <213> ORGANISM: Homo sap 



piens vtJ 



W — > 12190 <220> FEATURE: 

12191 <221> NAME /KEY : 

12192 <222> LOCATION: 
W — > 12193 <220> FEATURE: 

12194 <221> NAME/KEY: 

12195 <222> LOCATION: 
W — > 12196 <220> FEATURE: 

12197 <221> NAME/KEY 

12198 <222> LOCATION 
E — > 12199 <400> SEQUENCE 

12200 
12201 

12202 1 

12203 eta aca gaa aag 

12204 Leu Thr Glu Lys 

12205 15 



CDS 
(11) • • 

polyA_ 
(547)" 



(439) 

site 
'. (560) 



polyA_signal 
(530) . . (535) 

cagaacaatc atg tct gac tec ctg gtg gtg tgc gag gta gac cca gag 
Met Ser Asp Ser Leu Val Val Cys Glu Val Asp Pro Glu 
5 10 
ctg agg aaa ttc cgc ttc cga aaa gag aca gac aat 
Leu Arg Lys Phe Arg Phe Arg Lys Glu Thr Asp Asn 
20 25 



12206 gca gec ate 

12207 Ala Ala He 



ata atg aag gtg gac aaa gac egg cag atg gtg gtg 
He Met Lys Val Asp Lys Asp Arg Gin Met Val Val 



ctg 
Leu 



49 



97 



145 



file://C:\Crf3\Outhold\VsrI663600.htm 




5/31/01 



Please Note: 

| Use of n and/or Xaa have been dete£i in the Sequence Listing. Please revi Ahe 
I Se q ue nce Listing to ensure that a corresponding explanation is presented in We <220> to &A T 
<223> fields of each sequence which presents at least one n or Xaa. 

RAW SEQUENCE LISTING DATE: 05/31/2001 

PATENT APPLICATION: US/09/663,600 TIME: 19:04:05 

Input Set : C:\Jumbos\663600.txt 
Output Set: C:\CRF3\05312001\I663600.raw 

4 <110> APPLICANT: Dumas Milne Edwards, Jean-Baptiste 

5 Duclert, Aymeric 

6 Bougueleret, Lydie 

8 <120> TITLE OF INVENTION: EXTENDED CDNAS FOR SECRETED PROTEINS 
10 <130> FILE REFERENCE: 31.US3.CIP 
C~> 12 <140> CURRENT APPLICATION NUMBER: US/09/663,600 
C~> 13 <141> CURRENT FILING DATE: 2000-09-15 

15 <150> PRIOR APPLICATION NUMBER: 60/066,677 

16 <151> PRIOR FILING DATE: 1997-11-13 

18 <150> PRIOR APPLICATION NUMBER: 60/069,957 

19 <151> PRIOR FILING DATE: 1997-12-17 

21 <150> PRIOR APPLICATION NUMBER: 60/074,121 

22 <151> PRIOR FILING DATE: 1998-02-09 

24 <150> PRIOR APPLICATION NUMBER: 60/081,563 

25 <151> PRIOR FILING DATE: 1998-04-13 

27 <150> PRIOR APPLICATION NUMBER: 60/096,116 

28 <151> PRIOR FILING DATE: 1998-08-10 

30 <150> PRIOR APPLICATION NUMBER: 60/099,273 

31 <151> PRIOR FILING DATE: 1998-09-04 
33 <160> NUMBER OF SEQ ID NOS : 229 
35 <170> SOFTWARE: Patent. pm 

37 <210> SEQ ID NO: 1 

38 <211> LENGTH: 47 

39 <212> TYPE: RNA 

40 <213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: in vitro transcription product 
<220> FEATURE: 

<221> NAME/KEY: modif ied_base 
<222> LOCATION: 1 
<223> OTHER INFORMATION: m7g 
47 <400> SEQUENCE: 1 

ggcauccuac ucccauccaa uuccacccua acuccuccca ucuccac 4 7 

<210> SEQ ID NO: 2 
<211> LENGTH: 46 
<212> TYPE: RNA 

<213> ORGANISM: Artificial Sequence 
,<220> FEATURE: 

<223> OTHER INFORMATION: in vitro transcription product 
<400> SEQUENCE: 2 

gcauccuacu cccauccaau uccacccuaa cuccucccau cuccac 4 6 

<210> SEQ ID NO: 3 
<211> LENGTH: 25 

<212> TYPE: DNA - > A „ „ _/ v -4. A ,J 

<213> ORGANISM: Artificial Se quence ^J^jJj^^^C y toUMJU ^ 





W — > 63 <220> FEATURE: 

64 <223> OTHER INFORMATION 




The types of errors shown exist throughout the Sequence Listing. Please check 
subsequent sequences for similar errors. 



